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(54) Identification of a novel cysteine-rich cell penetrating peptide 



(57) The present invention relates to a nucleic acid 
molecule encoding a peptide capable of being internal- 
ized into a cell, wherein said nucleic acid molecule com- 
prises (a) a nucleic acid molecule encoding a peptide 
having the amino acid sequence of SEQ ID NO: 2; (b) a 
nucleic acid molecule having the DNA sequence of SEQ 
ID NO: 1 , wherein T is U if the nucleic acid molecule is 
RNA; (c) a nucleic acid molecule hybridizing under strin- 
gent conditions to the complementary strand of a nucleic 
acid molecule of (a) or (b), wherein the peptide encoded 
by said nucleic acid molecule has a cysteine at least at 
two positions selected from the group consisting of po- 
sitions 1 , 7 and 8 of SEQ ID NO: 2 and an arginine or a 
lysine at least at four positions selected from the groups 
consisting of position 2, 4, 6, 9 or 10 of SEQ ID NO: 2; 
(d) a nucleic acid molecule encoding a peptide having at 
least 70% sequence identity with that of SEQ ID NO: 2, 
wherein at least at two positions selected from the group 
consisting of positions 1, 7 and 8 of SEQ ID NO: 2 a 
cysteine is present and wherein at least at four positions 

< 

oo 

LO 

o 

O) 

o 

CM 

CL 
LU 



selected from the groups consisting of position 2, 4, 6, 9 
or 1 0 of SEQ ID NO: 2 an arginine or a lysine is present; 
or (e) a nucleic acid molecule degenerate with respect 
to the nucleic acid molecule of (c) or (d). The present 
invention also relates to a peptide encoded by the nucleic 
acid of the invention, a fusion molecule comprising the 
peptide of the invention and a composition comprising 
the peptide or the fusion molecule of the invention. Fur- 
thermore, the present invention relates to a method of 
detecting the internalization behaviour of a fusion mole- 
cule of the invention, the composition of the invention for 
treating and/or preventing a condition selected from can- 
cer, enzyme deficiency diseases, infarcts, cerebral 
ischemia, diabetes, inflammatory diseases, infections 
such as bacterial, viral or fungal infections, autoimmune 
diseases such as systemic lupus erythematodes (SLE) 
or rheumatoid arthritis, diseases with amyloid-like fibrils 
such as Alzheimer's disease (AD) and Parkinson's dis- 
ease (PD) or certain forms of myopathy. 
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Description 

[0001] The present invention relates to a nucleic acid molecule encoding a peptide capable of being internalized into 
a cell, wherein said nucleic acid molecule comprises (a) a nucleic acid molecule encoding a peptide having the amino 

5 acid sequence of SEQ ID NO: 2; (b) a nucleic acid molecule having the DNA sequence of SEQ ID NO: 1 , wherein T is 
U if the nucleic acid molecule is RNA; (c) a nucleic acid molecule hybridizing under stringent conditions to the comple- 
mentary strand of a nucleic acid molecule of (a) or (b), wherein the peptide encoded by said nucleic acid molecule has 
a cysteine at least at two positions selected from the group consisting of positions 1, 7 and 8 of SEQ ID NO: 2 and an 
arginine or a lysine at least at four positions selected from the group consisting of positions 2, 4, 6, 9 or 1 0 of SEQ ID 

10 NO: 2; (d) a nucleic acid molecule encoding a peptide having at least 70% sequence identity with that of SEQ ID NO: 
2, wherein at least at two positions selected from the group consisting of positions 1 , 7 and 8 of SEQ ID NO: 2 a cysteine 
is present and wherein at least at four positions selected from the group consisting of positions 2, 4, 6, 9 or 1 0 of SEQ 
ID NO: 2 an arginine or a lysine is present; or (e) a nucleic acid molecule degenerate with respect to the nucleic acid 
molecule of (c) or (d). The present invention also relates to a peptide encoded by the nucleic acid of the invention, a 

is fusion molecule comprising the peptide of the invention and a composition comprising the peptide or the fusion molecule 
of the invention. Furthermore, the present invention relates to a method of detecting the internalization behaviour of a 
fusion molecule of the invention, the composition of the invention for treating and/or preventing a condition selected from 
cancer, enzyme deficiency diseases, infarcts, cerebral ischemia, diabetes, inflammatory diseases, infections such as 
bacterial, viral or fungal infections, autoimmune diseases such as systemic lupus erythematodes (SLE) or rheumatoid 

20 arthritis, diseases with amyloid-like fibrils such as Alzheimer's disease (AD) and Parkinson's disease (PD) or certain 
forms of myopathy. 

[0002] In this specification, a number of documents are cited. The disclosure content of these documents including 
manufacturers' manuals is herewith incorporated by reference in its entirety. 

The targeted delivery of substances to cells has long been hampered by the cell membrane being an efficient protective 
25 wall to exclude most molecules that are not actively imported by living cells. Only a narrow range of molecules of certain 
molecular weight, polarity and net charge is able to diffuse through cell membranes. Other molecules have to be actively 
transported by e.g. receptor- mediated endocytosis or artificially forced through the cell membrane by methods such as 
electroporation, cationic lipids/liposomes, micro-injection, viral delivery, encapsulation in polymers . These methods are 
mainly utilized to deliver hydrophobic molecules. Furthermore, the side effects associated with these methods and the 
30 fact that their utilization is limited to in vitro uses has prevented them from becoming an efficient means to deliver 
substances such as drugs to the cell in order to treat diseases and conditions. 

The discovery of cell-penetrating peptides (CPPs) also called protein transduction domains (PTDs) or membrane trans- 
location sequences (MTS) proved that the translocation of larger molecules through the cell membrane is possible. 
Prominent examples of CPPs are the HIV-1 TAT translocation domain (Green and Loewenstein, 1988) and the home- 
35 odomain of the Antennapedia protein from Drosophila (Joliot et al., 1991). The exact translocation mechanism is still 
disputed. Mutation studies of the Antennapedia protein revealed that a sequence of 1 6 amino acids called penetratin or 
pAntp (Derossi etal., 1994) is necessary and sufficientfor membrane translocation. Inthefollowing, other protein-derived 
CPPs were developed such as the basic sequence of the HIV-1 Tat protein (Vives et al., 1 997) and the chimeric peptide 
transportan (Pooga et al., 1 998). A synthetic peptide developed is the amphipathic model peptide (Oehlke et al., 1 998). 
Coupling of antisense DNA or PNAs to CPPs was shown to exert the desired effect in vivo. 

It was long questioned which features were necessary for a CPP to exert the translocation function. In general, little 
structural resemblance has been found between the different families of CPPs. So far the only consistently found feature 
is the high content of basic amino acids resulting in a positive net charge. Thus, it is assumed that CPPs initially bind to 
negatively charged head groups of lipids or proteins in the cell membrane. In this regard, the importance of arginine as 
45 positive amino acid was demonstrated by several groups (Rothbard et al., 2000; Wender et al., 2000). Generally, an 
alpha-helical secondary structure has been predicted for CPPs which could be verified for some cases but cannot be 
taken as a general prerequisite. 

Many proteins able to translocate have severe side-effects on the cell, which is understandable in view of the fact that 
most of the naturally occurring substances are used as e.g. antimicrobial substances or toxins. CPPs can e.g. cause 

50 cytoplasmic leakage due to membrane disruption and also interfere with the functioning of membrane proteins. CPPs 
might also exhibit cellular toxic effects, such as e.g. transportan which affects GTPase activity (Soomets et al., 2000). 
Furthermore, it becomes more and more clear that many CPPs only exert their function under certain very narrow 
conditions which cannot be met in vivo. Another drawback is that, depending on the target cell, the CPPs may be rapidly 
degraded in the cells. Lastly, toxic and immunogenic effects of CPPs have been observed which prevent their utilization 

55 e.g. in therapeutic applications. 

[0003] Crotamine is one of the main toxins in the venom of the South American rattlesnake (Radis-Baptista et al., 
1999) and shows high homology with other venom myotoxins. The 42 amino acid long cationic polypeptide contains 1 1 
basic residues and six cysteines giving rise to three disulfide bonds. It has two putative NLS motifs, Crot 2 . 18 and Crot 2 7_3 9 . 
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Crotamine was shown to be a CPP penetrating into different cell types and mouse blastocysts in vitro (Kerkis et al., 
2004). It was shown to be non-toxic to a concentration of up to 1 fxM and to localize preferably in the nucleus where it 
is supposed to bind to chromatin structures. When applied before cell division, crotamine is mainly localized in the 
cytoplasm after the telophase. Up to now and depending on the mechanism of internalization, known CPPs mainly 
5 localize in the nucleus or, in case they are internalized in vesicles, remain there and only a small part is released into 
the cytoplasm. 

[0004] Thus, there is a need for CPPs with improved properties. 

[0005] Accordingly, the present invention relates to a nucleic acid molecule encoding a peptide capable of being 
internalized into a cell, wherein said nucleic acid molecule comprises (a) a nucleic acid molecule encoding a peptide 

10 having the amino acid sequence of SEQ ID NO: 2; (b) a nucleic acid molecule having the DNA sequence of SEQ ID 
NO: 1 , wherein T is U if the nucleic acid molecule is RNA; (c) a nucleic acid molecule hybridizing under stringent conditions 
to the complementary strand of a nucleic acid molecule of (a) or (b), wherein the peptide encoded by said nucleic acid 
molecule has a cysteine at least at two positions selected from the group consisting of positions 1, 7 and 8 of SEQ ID 
NO: 2 and an arginine or a lysine at least at four positions selected from the group consisting of positions 2, 4, 6, 9 or 

15 1 0 of SEQ ID NO: 2; (d) a nucleic acid molecule encoding a peptide having at least 70% sequence identity with that of 
SEQ ID NO: 2, wherein at least at two positions selected from the group consisting of positions 1, 7 and 8 of SEQ ID 
NO: 2 a cysteine is present and wherein at least at four positions selected from the group consisting of positions 2, 4, 
6, 9 or 1 0 of SEQ ID NO: 2 an arginine or a lysine is present; or (e) a nucleic acid molecule degenerate with respect to 
the nucleic acid molecule of (c) or (d). 

20 [0006] The term "nucleic acid molecule" as used interchangeably with the term "polynucleotide", in accordance with 
the present invention, includes DNA, such as cDNA or genomic DNA, and RNA. If the nucleic acid molecule is RNA, 
thymine (T) bases denoted in e.g. SEQ ID NO: 1 are replaced with uracil (U), the thymine analogue occurring in RNA. 
Further included are nucleic acid mimicking molecules known in the art such as synthetic or semi-synthetic derivatives 
of DNA or RNA and mixed polymers. Such nucleic acid mimicking molecules or nucleic acid derivatives according to 

25 the invention include phosphorothioate nucleic acid, phosphoramidate nucleic acid, 2'-0-methoxyethyl ribonucleic acid, 
morpholino nucleic acid, hexitol nucleic acid (HNA) and locked nucleic acid (LNA) (see Braasch and Corey, Chem Biol 
2001, 8: 1). LNA is an RNA derivative in which the ribose ring is constrained by a methylene linkage between the 2'- 
oxygen and the 4'-carbon. They may contain additional non-natural or derivative nucleotide bases, as will be readily 
appreciated by those skilled in the art. For the purposes of the present invention, also a peptide nucleic acid (PNA) can 

30 be applied. Peptide nucleic acids have a backbone composed of repeating N-(2-aminoethyl)-glycine units linked by 
peptide bonds. The purine and pyrimidine bases are linked to the backbone by methylene carbonyl bonds. 
In a preferred embodiment, the nucleic acid molecule is DNA. 

[0007] The term "peptide" as used herein describes linear molecular chains of amino acids, including fragments of 
single chain proteins, containing up to 30 amino acids. Peptides may form oligomers consisting of at least two identical 

35 or different molecules. The corresponding higher order structures of such multimers are, correspondingly, termed homo- 
orheterodimers, homo- or heterotrimers etc.. The term "peptide" furthermore comprises peptidomimetics of such peptides 
where amino acid(s) and/or peptide bond(s) have been replaced by functional analogues. Such functional analogues 
also include all known amino acids other than the 20 gene-encoded amino acids, such as selenocysteine. 
A polypeptide as used in the context of the present invention contains more than 30 amino acids. In accordance with 

^o the invention, the term is interchangeably used with "protein". 

The terms "peptide" and "polypeptide" also refer to naturally modified peptides and polypeptides where the modification 
is effected e.g. by glycosylation, acetylation, phosphorylation and similar modifications which are well known in the art. 
[0008] The term "capable of being internalized" as used in the context of the present invention refers to the ability of 
some peptides to pass the plasma membrane of cells. Different mechanisms of internalization are proposed in the 

45 literature: an energy-dependent endocytotic mechanism and an energy-independent passive transport mechanism. The 
latter can be further divided into several suggested models. In the inverted micelle-driven delivery model, the positively 
charged part of the CPP interacts with the phospholipids in the membrane, followed by the interaction of the hydrophobic 
part of the peptide with the membrane, creating the inverted micelle. Another model suggests direct penetration of the 
plasma membrane. It was suggested by example of the TAT peptide that the mechanism of translocation depends on 

50 the cargo attached to the peptide. Size may play a role as well as the chemical properties of the cargo. Furthermore, it 
was shown that the mechanisms may vary depending on the concentration of CPP. For a recent review see e.g. Trehin 
and Merkle (2004), Magzoub and Graslund (2004) or Gupta et al. (2005). In the context of the present invention, any 
possible mechanism of internalization is envisaged. A preferred mechanism would ensure that at least a part, preferably 
more than 30%, more preferably more than 40%, even more preferably more than 50%, even more preferably more 

55 than 60%, even more preferably more than 70% and most preferably more than 80% of the CPP conjugate localizes in 
the cytoplasm in contrast to localization in different compartments, e.g. in vesicles, endosomes or in the nucleus. 
[0009] The term "hybridizes/hybridizing" as used herein refers to a pairing of a nucleic acid molecule to a (partially) 
complementary strand of this nucleic acid molecule which thereby forms a hybrid. 
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[001 0] It is well known in the art howto perform hybridization experiments with nucleic acid molecules. Correspondingly, 
the person skilled in the art knows what hybridization conditions she/he has to use to allow for a successful hybridization 
in accordance with item (c), above. The establishment of suitable hybridization conditions is referred to in standard text 
books such as Sambrook, Russell "Molecular Cloning, A Laboratory Manual", Cold Spring Harbor Laboratory, N.Y. 
5 (2001); Ausubel, "Current Protocols in Molecular Biology", Green Publishing Associates and Wiley Interscience, N.Y. 
(2002), or Higgins and Hames (Eds.) "Nucleic acid hybridization, a practical approach" IRL Press Oxford, Washington 
DC, (1985). 

[0011] "Stringent conditions" refers to hybridization conditions under which the polynucleotides that are capable of 
hybridizing to the polynucleotides of the invention or parts thereof do not cross hybridize to unrelated polynucleotides. 

10 Appropriate stringent hybridization conditions for each nucleic acid sequence may be established by a person skilled in 
the art on well-known parameters such as temperature, composition of the nucleic acid molecules (GC-content), salt 
conditions etc.; see, for example, Sambrook et al., "Molecular Cloning, A Laboratory Manual"; CSH Press, Cold Spring 
Harbor, 1989 or Higgins and Hames (eds.), loc. cit., see in particular the chapter "Hybridization Strategy" by Britten & 
Davidson, 3 to 15. Such conditions comprise e.g. an overnight incubation at 65°C in 4x SSC (600 mM NaCI, 60 mM 

is sodium citrate) followed by washing at 65°C in 0. 1 x SSC for one hour. Alternatively, hybridization conditions can comprise: 
an overnight incubation at 42°C in a solution comprising 50% formamide, 5x SSC (750 mM NaCI, 75 mM sodium citrate), 
50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 10% dextran sulphate, and 20 fxg/ml denatured, sheared 
salmon sperm DNA, followed by washing in e.g. 0.1-0.5x SSC at about 55-65°C for about 5 to 20 min. In general, for 
achieving "high stringency", salt concentrations of 0.01 65 to 0.033 M and a temperature of 5 to 1 0 °C below the melting 

20 temperature (Tm) of the nucleic acid are sufficient. Only sequences with a high degree of identity will hybridize under 
these conditions. 

It is of note that variations in the above conditions may be accomplished through the inclusion and/or substitution of 
alternate blocking reagents. Typical blocking reagents include Denhardt's reagent, BLOTTO, heparin, denatured salmon 
sperm DNA, and commercially available proprietary formulations. The inclusion of specific blocking reagents may require 

25 modification of the hybridization conditions described above, due to problems with compatibility. Changes in the stringency 
of hybridization are primarily accomplished through the manipulation of formamide concentration (lower percentages of 
formamide result in lowered stringency), salt conditions (lower salt concentrations result in lowered stringency), or 
temperature. Such modifications can generally be effected by the skilled person without further ado. 
As disclosed by Nielsen et al., Science 254:1 497 (1 991 ); and Egholm et al., Nature 365:666 (1 993), PNAs bind specifically 

30 and tightly to complementary DNA strands and are not degraded by nucleases. In fact, PNA binds more strongly to DNA 
than DNA itself does. This is probably because there is no electrostatic repulsion between the two strands, and also the 
polyamide backbone is more flexible. Because of this, PNA/DNA duplexes bind under a wider range of stringency 
conditions than DNA/DNA duplexes, making it easier to perform multiplex hybridization. Smaller probes can be used 
than with DNA due to the strong binding. In addition, it is more likely that single base mismatches can be determined 

35 with PNA/DNA hybridization because a single mismatch in a PNA/DNA 15-mer lowers the melting point (T.sub.m) by 
8°-20° C, vs. 4°-1 6° C for the DNA/DNA 1 5-mer duplex. Also, the absence of charge groups in PNA means that hybrid- 
ization can be done at low ionic strengths and reduce possible interference by salt during the analysis. 
[0012] A hybridization complex may be formed in solution (e.g., Cot or Rot analysis) or between one nucleic acid 
sequence present in solution and another nucleic acid sequence immobilized on a solid support (e.g., membranes, 
filters, chips, pins or glass slides to which, e.g., cells have been fixed). 

Examples for hybridization assays comprise without limitation Northern and Southern blot assays, heteroduplex analysis, 
detection of mutations by sequence specific oligonucleotide hybridization, allele-specific oligonucleotide hybridization 
on DNA chips, assays based on the lllumina's® technology, assays based on the BeadArray® technology, see, for 
example, Barnes et al., Nucleic Acids Res. 33 (2005) 5914-5923; Fan et al., Biotechniques 39 (2005) 583-588; Shen et 

45 al., Mutat. Res. 573 (2005) 70-82; Steemers and Gunderson, Pharmacogenomics, 6 (2005) 777-782. 

[0013] Nucleic acid molecules are "complementary" if they naturally bind to each other under permissive salt and 
temperature conditions by base-pairing. For example, the sequence "A-G-T" binds to the complementary sequence "T- 
C-A". "Complementary" in accordance with the invention, refers to the complete base pairing of nucleotides over the 
entire length of the nucleic acid molecule of the invention. 

50 [0014] As regards the presence of specific amino acids at certain positions of the peptide encoded by the nucleic acid 
molecule of the present invention, these positions can also be assigned to the sequence of said peptide if it is present 
in a longer peptide or protein. More particularly, if the stretch of amino acids homologous or identical to the peptide 
encoded by SEQ ID NO: 1 is identified in a nucleic acid sequence encoding a longer peptide or protein, both sequences 
can be aligned and the positions are assigned. From this information, the positions in the longer peptide or protein 

55 corresponding to the respective amino acid in the peptide encoded by SEQ ID NO: 1 can be retrieved. 

[0015] In accordance with the present invention, the term "percent (%) sequence identity" describes the number of 
matches ("hits") of identical nucleotides/amino acids of two or more aligned nucleic acid or amino acid sequences as 
compared to the number of nucleotides or amino acid residues making up the overall length of the template nucleic acid 
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oramino acid sequences. In other terms, using an alignment, fortwo or more sequences orsubsequences the percentage 
of amino acid residues or nucleotides that are the same (e.g., 70%, 80% or 85% identity) may be determined, when the 
(sub)sequences are compared and aligned for maximum correspondence over a window of comparison, or over a 
designated region as measured using a sequence comparison algorithm as known in the art, or when manually aligned 

5 and visually inspected. This definition also applies to the complement of a test sequence. 

To evaluate the identity level between two nucleotide or protein sequences, they can be aligned electronically using 
suitable computer programs known in the art. Such programs comprise BLAST (Altschul etal., J. Mol. Biol. 1990, 215: 
403), variants thereof such as WU-BLAST (Altschul & Gish, Methods Enzymol. 1996, 266: 460), FASTA (Pearson & 
Lipman, Proc. Natl. Acad. Sci. USA 1988, 85: 2444) or implementations of the Smith -Waterman algorithm (SSEARCH, 

10 Smith & Waterman, J. Mol. Biol. 1981 , 147: 1 95). These programs, in addition to providing a pairwise sequence alignment, 
also report the sequence identity level (usually in percent identity) and the probability forthe occurrence of the alignment 
by chance (P-value). The BLASTN program for nucleic acid sequences uses as default a word length (W) of 11, an 
expectation (E) of 10, M=5, N=4, and a comparison of both strands. For amino acid sequences, the BLASTP program 
uses as default a word length (W) of 3, and an expectation (E) of 10. The BLOSUM62 scoring matrix (Henikoff, Proc. 

is Natl. Acad. Sci. , 1 992, 89: 1 091 5) uses alignments (B) of 50, expectation (E) of 1 0, M=5, N=4, and a comparison of both 
strands. Programs such as CLUSTALW (Thompson Nucl. Acids Res. 2 (1994), 4673-4680) can be used to align more 
than two sequences. In addition, CLUSTALW, unlike e.g. FASTDB, does take sequence gaps into account in its identity 
calculations. 

All of the above programs can be used in accordance with the invention. 
20 in a preferred embodiment of the present invention, the sequence identity to SEQ ID NO: 2 is at least 80%, more 
preferably at least 90% and most preferred at least 95%. 

[0016] Substitutions in the amino acid sequence of the peptide of the present invention are preferably conservative. 
This means that substitutions preferably take place within one class of amino acids. For example, a positively charged 
amino acid is preferably mutated to another positively charged amino acid. The same holds true for the classes of basic, 

25 aromatic or aliphatic amino acids. 

[001 7] Degenerate in accordance with the present invention refers to the degeneracy of the genetic code. Degeneracy 
results because a triplet code designates 20 amino acids and a stop codon. Since four bases exist which are utilized to 
encode genetic information, triplet codons are required to produce at least 21 different codes, i.e. one for each of the 
20 gene-encoded amino acids and one for a stop codon. The possible 4 3 possibilities for bases in triplets give 64 possible 

30 codons, meaning that some degeneracy must exist. As a result, some amino acids are encoded by more than one triplet, 
i.e. by up to six. The degeneracy mostly arises from alterations in the third position in a triplet. Accordingly, nucleic acid 
molecules having a different sequence than that specified above, but still encoding the same peptide lie within the scope 
of the present invention. 

[001 8] In the course of the present invention, it has surprisingly been found that a peptide derived from crotamine but 

35 having a reduced length as compared to a fragment of crotamine consisting of amino acids 27 to 39 (Cro 27 . 39 ^ and 
retaining all cysteines has improved internalization properties as compared to naturally occurring crotamine orCro 2 7_3 9 . 
The C-terminal nuclear localization signal Cro 27 _ 39 (KMDCRWRWKCCKK) proposed as one of two potential sequences 
responsible for membrane permeation (Kerkis et al., 2004) was separately examined for its internalization properties. 
The uptake behavior of this fragment was studied with a fluorophore attached to it. It proved to be an efficient CPP also 
showing cytoplasmic diffusion. Cytoplasmic diffusion is particularly desirable to deliver pharmacologically active sub- 
stances to the cell. This additional feature led the present inventors to introduce changes to this fragment to avoid the 
use of amino acids like methionine, tryptophan, aspartic acid and, in particular, cysteine which makes the synthesis 
more complex and challenging. Avoiding cysteines will not only facilitate the synthesis, handling and storage of the 
peptide but is also suggested to improve the in vivo properties of the peptide since cysteines will likely form intra- and 

45 intermolecular cysteine bridges and therefore promote aggregation of the peptide. The absence or presence of disulphide 
bonds can be important for peptides to maintain their biological activity and conformational stability. However, up to now, 
the role of cysteines or cysteine bridges in CPP has not been examined. International patent application WO03/1 06491 
discloses a method for predicting or designing CPPs. It is of note that the majority of peptides predicted and/or shown 
to exert CPP properties do not contain any cysteine. 

50 Positively charged amino acids (lysine and arginines) were not modified as these are an important feature of a cell 
penetrating peptide. Different derivatives wherein cysteines were substituted with alpha amino butyric acid or serine 
(close analogues of cysteine) were synthesized as well as fragments of the sequence Cro 27 _39 by deleting amino acids 
from the N-terminus. Alternatively, cysteines were deleted one by one or amino acids anywhere in the sequence were 
deleted. Finally, tryptophan was substituted by proline or phenylalanine. All combinations were synthesized containing 

55 a lysine at the N-terminus as a linker for the coupling of FITC (at the e-amino group) for analyzing samples for intracellular 
uptake by fluorescence imaging. 

[0019] The internalization studies on cultured cells show that a change in the amino acid composition dramatically 
affects the intracellular uptake (see Figures 1 to 3 and the examples). Not only the deletion of cysteines one by one in 
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different combinations but also the substitution by a-ami no butyric acid reduced the cellular uptake with the number of 
cysteines deleted or substituted. Substitution of cysteines by serine shows nearly the same results. Since proline-rich 
peptides are known to enhance cell permeability, tryptophan was substituted with proline. However, internalization was 
again immensely affected to the negative. The best results are observed for a fragment K(FITC)-CRWRWKCCKK 

5 (peptide 23 of table 1 below displaying a complete list of all sequences examined for their internalization properties) 
which is competently taken up by cells and also show endosomal as well as cytoplasmic fluorescence distribution. This 
fragment without the linker is three amino acids shorter than the original fragment Cro 27 _ 39 having the sequence KMD- 
CRWRWKCCKK) keeping the number of the cysteines the same. Studies carried out on well known CPPs like Tat, 
Antennapedia and polyarginines revealed that the role of the positive charge is crucial for translocation. Unlike the known 

10 CPPs the CPP of the present invention is markedly different in terms of its function. Efficient cellular uptake and cytosolic 
location along with vesicular distribution at subtoxic concentrations (<2.5 fxM) is the distinguishing feature which is also 
shown by other CPP but at comparatively high concentrations (>10 juuM). Other distinctive features are the influence of 
the chirality of the peptide backbone as well as the sequence order on the cellular uptake: Forms of the proposed peptide 
with the sequence of the CPP of the present invention reversed, the sequence of the CPP of the present invention with 

is D-amino acids, or with D-amino acids and in reversed order also showed lower uptake and cytosolic diffusion unlike 
known for the Tat peptide. 

The entire study shows the significance of each amino acid focusing the requirement of charge and hydrophobic residues 
during membrane permeation. As known from previous studies charged residues help to adhere to the cell surface which 
is the first step of internalization and then tryptophan might aid membrane translocation by membrane destabilization. 
20 [0020] Thus, it has been found that the CPP of the invention actually requires tryptophanes and, surprisingly, cysteines 
in order to optimally exert its internalization properties. Accordingly, an optimized peptide according to the present 
invention is encoded by the nucleic acid sequence depicted in SEQ ID NO: 1 or has the amino acid sequence of SEQ 
ID NO: 2. It encodes three cysteines and two tryptophanes which unexpectedly are shown to improve the internalization 
properties of the peptide. 

25 Being rich in amino acids like methionine, aspartic acid, and cysteine Cro 27 _39 endures difficulties during synthesis and 
storage. Thus it becomes necessary to avoid such amino acids by replacement or deletion as much as possible which 
is the main goal of this study. Herein we showed the development of the proposed fragment from the known one. Both 
the sequences are comparable in terms of function (internalization efficiency and cytosolic distribution) but the proposed 
sequence seems to give the same results being smaller in length and avoiding at least methionine and aspartic acid. 

30 

Table 1 : Sequences derived from Cro 27 _ 39 and examined for their internalization properties in the present invention. 
The individual sequences are discussed in further detail in the appended examples. X: ot-aminobutyric acid; Capitals 



denote L-amino acids, minuscules denote D-amino acids 



1 


K (FITC) - KM DCRWRWKCCKK (Cro 27 . 39 ) 


2 


K (FITC) - KMDXRWRWKCCKK 


3 


K (FITC) - KM DXRWRWKXCKK 


4 


K (FITC) - KM DXRWRWKXXKK 


5 


K (FITC) - KM DXRWRWKCXKK 


6 


K (FITC) - KM DCRWRWKCXKK 


7 


K (FITC) - KM DCRWRWKCSKK 


8 


K (FITC) - KMDCRWRWKSCKK 


9 


K (FITC) - KMDSRWRWKCCKK 


10 


K (FITC) - KMDCRWRWKSSKK 


11 


K (FITC) - KM DSRWRWKSSKK 


12 


K (FITC) - KM DSRWRWKSCKK 


13 


K (FITC) - KMDSRWRWKCSKK 


14 


K (FITC) - KMDCRWRPKCCKK 


15 


K (FITC) - KMDCRPRPKCCKK 


16 


K (FITC) - KMDXRPRPKCCKK 


17 


K (FITC) - KMDXRPRPKXCKK 



6 



EP 2 090 584 A1 



(continued) 





18 


K (FITC) - KMDXRPRPKCXKK 




19 


K (FITC) - KMDCRPRPKXCKK 


5 


20 


K (FITC) - KMDCRPRPKCXKK 




21 


K (FITC) - MDCRWRWKCCKK 




22 


K (FITC) - DCRWRWKCCKK 


in 


23 


K (FITC) - CRWRWKCCKK 




24 


K (FITC) - RWRWKCCKK 




25 


K (FITC) - MDCRWRWKXCKK 


15 


26 


K (FITC) - DCRWRWKXCKK 


27 


K (FITC) - DCRWRWKCXKK 




28 


K (FITC) - CRWRWKXCKK 




29 


K (FITC) - CRWRWKCXKK 


PO 


30 


K (FITC) - RWRWKXCKK 




31 


K (FITC) - MDCRWRWKXXKK 




32 


K (FITC) - DCRWRWKXXKK 


25 


33 


K (FITC) - CRWRWKXXKK 


34 


K (FITC) - RWRWKXXKK 




35 


K (FITC) - KM DCRWRWKCKK 




36 


K (FITC) - KM DCRWRWKKK 


30 


37 


K (FITC) - KM D RWRWKKK 




38 


K (FITC) - KDCRWRWKCCKK 




39 


K (FITC) - KC RWRWKCCKK 


35 


40 


K (FITC) - KRWRWKCCKK 


41 


K (FITC) - CRWRWKCSKK 




42 


K (FITC) - CRWRWKSSKK 




43 


K (FITC) - SRWRWKSSKK 


40 


44 


K (FITC) - SRWRWKCSKK 




45 


K (FITC) - SRWRWKSCKK 




46 


K (FITC) - SRWRWKCCKK 


45 


47 


K (FITC) - CRFRWKCCKK 


48 


K (FITC) - CRWRFKCCKK 




49 


K (FITC) - CRFRFKCCKK 




50 


K (FITC) - KCCKWRWRCK 


50 


51 


k (FITC) - crwrwkcckk 




52 


k (FITC) - kcckwrwrck 




53 


K (FITC) - CrWRWKCCKK 


55 


54 


K (FITC) - CRwRWKCCKK 


55 


K (FITC) - CRWrWKCCKK 




56 


K (FITC) - CRWRwKCCKK 
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(continued) 



57 


K (FITC) - CrwrwKCCKK 


58 


K (FITC) - CRWRWKCGCKK 


59 


K (FITC) - CCRWRWKCCKK 


60 


K (FITC) - CGCRWRWKCGCKK 


61 


K(FITC) -CRWRWKCG 



10 

[0021] In a preferred embodiment, the peptide encoded by the nucleic acid molecule further comprises a linker. 
[0022] A linker as used in connection with the present invention is used to connect the peptide of the invention with 
other moieties. The linker serves to physically separate the peptide of the invention and the other moiety or moieties 
and to ensure that neither the peptide of the invention nor the other moieties are limited in their function due to the close 

15 vicinity to the other. Depending on the other moiety, the linker can be a peptide bond, an amino acid, a peptide of 
appropriate length, or a different molecule providing the desired features. The skilled person knows how to design 
appropriate linker molecules, in particular linker peptides based on his common knowledge. For example, peptide linkers 
can be chosen from the LIP (Loops in Proteins) database (Michalsky et al., 2003). A linker may be appended to the N- 
orthe C-terminus or, if deemed suitable, also to an amino acid apart from the terminal amino acids of the peptide of the 

2 0 present invention. The linker is preferably located at the N-terminus. 

A moiety as used in connection with the present invention is a functional unit. A moiety can e.g. be a linker minimally 
comprising a lysine. Other moieties can be e.g. selectable markers such as FITC or drugs as described in more detail 
below. 

[0023] In a more preferred embodiment, the linker is a lysine. The e-amino group in lysine is suitable to couple the 

25 peptide of the invention to various other moieties. Furthermore, the lysine may serve to further improve the internalization 
properties of the peptide of the invention and may then be supplemented by another linker if deemed suitable. 
[0024] The present invention also relates to a vector comprising the nucleic acid molecule of the present invention. 
Preferably, the vector is a plasmid, cosmid, virus, bacteriophage or another vector used conventionally e.g. in genetic 
engineering. An expression vector according to this invention is capable of directing the replication and the expression 

30 of the nucleic acid molecule of the invention and the peptide or polypeptide encoded thereby. Generally, vectors can 
contain one or more origins of replication (ori) and inheritance systems for cloning or expression, one or more markers 
for selection in the host and one or more expression cassettes. Methods which are well known to those skilled in the art 
can be used to construct and modify recombinant vectors; see, for example, the techniques described in Sambrook and 
Russell, 2001 and Ausubel, 2001 . 

35 [0025] The coding sequences inserted in the vector can e.g. be synthesized by standard methods, or isolated from 
recombinant sources. Ligation of the coding sequences to transcriptional regulatory elements and/or to other amino acid 
encoding sequences can be carried out using established methods. Transcriptional regulatory elements (parts of an 
expression cassette) ensuring expression in prokaryotes or eukaryotic cells are well known to those skilled in the art. 
These elements comprise regulatory sequences ensuring the initiation of the transcription (e.g., translation initiation 

40 codon, promoters, enhancers, and/or insulators), internal ribosomal entry sites (I RES) (Owens etal., 2001 ) and optionally 
poly-A signals ensuring termination of transcription and stabilization of the transcript. Additional regulatory elements 
may include transcriptional as well as translational enhancers, and/or naturally-associated or heterologous promoter 
regions. Preferably, the nucleic acid molecule of the invention is operably linked to such expression control sequences 
allowing expression in prokaryotes or eukaryotic cells e.g. in the form of a vector. The vector may further comprise 

45 nucleotide sequences encoding secretion signals as further regulatory elements. Such sequences are well known to 
the person skilled in the art. Furthermore, depending on the expression system used, leader sequences capable of 
directing the expressed polypeptide to a cellular compartment may be added to the coding sequence of the nucleic acid 
molecule of the invention. Such leader sequences are well known in the art. Specifically-designed vectors allow the 
shuttling of DNA between different hosts, such as between bacteria and fungal cells or bacteria and animal cells. 

50 [0026] The nucleic acid molecules of the invention as described herein above may be designed for direct introduction 
or for introduction via liposomes, phage vectors or viral vectors (e.g. adenoviral, retroviral) into the cell. Additionally, 
baculoviral systems or systems based on Vaccinia Virus or Semliki Forest Virus can be used as vector in eukaryotic 
expression system for the nucleic acid molecules of the invention. Other expression vectors derived from viruses and 
usable for delivery of the polynucleotides or vector into targeted cell populations are retroviruses, vaccinia virus, adeno- 

55 associated virus, herpes viruses, or bovine papilloma virus. 

[0027] A typical origin of replication in mammalian vectors is the SV40 viral ori. Additional elements in mammalian 
vectors might include enhancers, Kozak sequences and intervening sequences flanked by donor and acceptor sites for 
RNA splicing. Highly efficient transcription can be achieved with the early and late promoters from SV40, the long terminal 
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repeats (LTRs)from retroviruses, e.g., RSV, HTLVI, HI VI, and the early promoter of the cytomegalovirus (CMV). However, 
cellular elements can also be used (e.g., the human actin promoter). Other examples for regulatory elements permitting 
expression in eukaryotic host cells are the AOX1 or GAL1 promoter in yeast or the CMV- (Cytomegalovirus), SV40-, 
RSV-promoter (Rous sarcoma virus), chicken beta-actin promoter, CAG-promoter (a combination of chicken beta-actin 

5 promoter and cytomegalovirus immediate-early enhancer), the gai10 promoter, human elongation factor la-promoter, 
CMV enhancer, CaM-kinase promoter, the Autographa californica multiple nuclear polyhedrosis virus (AcMNPV) poly- 
hedral promoter or a globin intron in mammalian and other animal cells. Besides elements which are responsible for the 
initiation of transcription such regulatory elements may also comprise transcription termination signals, such as the 
SV40-poly-A site or the tk-poly-A site or the SV40, lacZ and AcMNPV polyhedral polyadenylation signals, downstream 

10 of the polynucleotide. Suitable selectable markers are dhfr, gpt, G418 neomycin, hygromycin allows the identification 
and isolation of the transfected cells. The transfected nucleic acid can also be amplified to express large amounts of the 
encoded peptide. The dhfr (dihydrofolate reductase) marker is useful to develop cell lines that carry several hundred or 
even several thousand copies of the gene of interest. Another useful selection marker is the enzyme glutamine synthase 
(GS) (Murphy et al. 1991; Bebbington et al. 1992). Using these markers, the mammalian cells are grown in selective 

is medium and the cells with the highest resistance are selected. 

[0028] Suitable origins of replication for prokaryotic cells include, for example, the Col E1 and the M 13 origins of 
replication. Examples of suitable markers for culturing in E. co\\ and other prokaryotes include tetracycline, kanamycin 
or ampicillin resistance genes. Possible regulatory elements permitting expression in prokaryotic host cells comprise, 
e.g., the lac, trp or tac promoter, the lacUV5 or the trp promotor in E. coli. 

20 [0029] The nucleic acid molecule of the present invention may be inserted into several commercially available vectors 
well known to the skilled person who is also able to determine which vectors are suitable for the introduction and/or 
expression of the peptide of the invention. Non-limiting examples include prokaryotic plasmid vectors, such as the pUC- 
series, pBluescript (Stratagene), the pET-series of expression vectors (Novagen) or pCRTOPO (Invitrogen), lambda 
gt1 1 , pJOE, the pBBR1 -MCS series, pJB861 , pBSMuL, pBC2, pUCPKS, pTACTI and vectors compatible with expression 

25 in mammalian cells like pREP (Invitrogen), pCEP4 (Invitrogen), pMC1 neo (Stratagene), pXT1 (Stratagene), pSG5 (Strat- 
agene), EBO-pSV2neo, pBPV-1 , pdBPVMMTneo, pRSVgpt, pRSVneo, pSV2-dhfr, plZD35, Okayama-Berg cDNA ex- 
pression vector pcDV1 (Pharmacia), pRc/CMV, pcDNAI, pcDNA3 (Invitrogene), pSPORTI (GIBCO BRL), pGEMHE 
(Promega), pLXIN, pSIR (Clontech), pIRES-EGFP (Clontech), pEAK-10 (Edge Biosystems) pTriEx-Hygro (Novagen) 
and pCINeo (Promega). Examples for plasmid vectors suitable for Pichia pastoris comprise e.g. the plasmids pA0815, 

30 pPIC9K and pPIC3.5K (all Intvitrogen). 

[0030] The nucleic acid molecule of the present invention referred to above may also be inserted into vectors such 
that a translational fusion with another nucleic acid molecule is generated. The other nucleic acid molecule may encode 
a protein which may e.g. increase the solubility and/or facilitate the purification of the peptide encoded by the nucleic 
acid molecule of the invention. Non-limiting examples include pET32, pET41, pET43. The additional expressible poly- 

35 nucleotide may also encode one or more chaperones to facilitate correct protein folding. Furthermore, the translational 
fusion described above may encode afusion molecule of the invention as will be described elsewhere in this specification. 
[0031] In another embodiment, the present invention relates to a non-human host transfected or transformed with the 
vector of the invention. 

[0032] Non-human hosts according to the invention can be single cells or multi-cellular organisms. 

^o [0033] In a more preferred embodiment, the non-human host is a cell. 

[0034] Suitable prokaryotic host cells comprise e.g. bacteria of the species Escherichia, such as strains derived from 
BL21 (e.g. BL21(DE3), BL21 (DE3)PlysS, BL21 (DE3)RIL, BL21 (DE3)PRARE) or Rosetta®, Streptomyces, Salmonella 
or Bacillus. Suitable eukaryotic host cells are e.g. yeasts such as Saccharomyces cerevisiae or Pichia pastoris or insect 
cells such as Drosophila S2 or Spodoptera Sf9 cells. 

45 Mammalian host cells that could be used include human Hela, HEK293, H9 and Jurkat cells, mouse NIH3T3 and C1 27 
cells, COS 1 , COS 7 and CV1 , quail QC1-3 cells, mouse L cells, Bowes melanoma cells and Chinese hamster ovary 
(CHO) cells. Also within the scope of the present invention are primary mammalian cells or cell lines. Primary cells are 
cells which are directly obtained from an organism. Suitable primary cells are, for example, mouse embryonic fibroblasts 
(MEF), mouse primary hepatocytes, cardiomyocytes and neuronal cells as well as mouse muscle stem cells (satellite 

50 cells) and stable, immortalized cell lines derived therefrom. The recombinant peptide of the invention can be expressed 
in stable cell lines that contain the gene construct integrated into a chromosome or, in stable or transiently transfected 
cells, in the form of a plasmid. 

[0035] Transgenic non-human animals as hosts transfected with, e.g. a gene gun, and/or expressing the nucleic acid 
molecule of the present invention also lie within the scope of the invention. In a preferred embodiment, the transgenic 
55 animal is a mammal, e.g. a hamster, mouse, rat, cow, cat, pig, dog, horse, rabbit or monkey. Transgenic plants as hosts 
transfected with and/or expressing the nucleic acid molecule of the present invention also lie within the scope of the 
present invention. 

[0036] In yet another embodiment, the present invention relates to a method of producing a peptide of the invention 
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comprising culturing the host cell of the invention under suitable conditions and isolating the peptide produced. 
[0037] A large number of suitable methods exist in the art to produce peptides in appropriate hosts. If the host is a 
unicellular organism such as a prokaryote or a mammalian or insect cell, the person skilled in the art can revert to a 
variety of culture conditions. Conveniently, the produced protein is harvested from the culture medium, lysates of the 

5 cultured cells or from isolated (biological) membranes by established techniques. In the case of a multicellular organism 
as a host comprising multiple cells carrying the nucleic acid of the invention, a fraction of these cells may serve as source 
for the peptide of the invention, for example said fraction may be the harvestable part of a plant. A preferred method 
involves the synthesis of nucleic acid sequences c by PCR and its insertion into an expression vector. Subsequently a 
suitable host may be transfected or transformed with the expression vector. Thereafter, in the case that the host is a 

10 cell, the host is cultured to produce the desired peptide, which is isolated and purified. 

[0038] Appropriate culture media and conditions for the above-described host cells are known in the art. For example, 
suitable conditions for culturing bacteria are growing them under aeration in Luria Bertani (LB) medium. To increase the 
yield and the solubility of the expression product, the medium can be buffered or supplemented with suitable additives 
known to enhance or facilitate both. E. coli can be cultured from 4 to about 37 °C, the exact temperature or sequence 

is of temperatures depends on the molecule to be overexpressed. In general, the skilled person is also aware that these 
conditions may have to be adapted to the needs of the host and the requirements of the peptide or protein expressed. 
In case an inducible promoter controls the nucleic acid of the invention in the vector present in the host cell, expression 
of the polypeptide can be induced by addition of an appropriate inducing agent. Suitable expression protocols and 
strategies are known to the skilled person. 

20 [0039] Depending on the cell type and its specific requirements, mammalian cell culture can e.g. be carried out in 
RPMI or DMEM medium containing 10% (v/v) FCS, 2mM L-glutamine and 100 U/ml penicillin/streptomycin. The cells 
can be kept at 37 °C in a 5% C0 2 , water saturated atmosphere. 

Suitable media for insect cell culture is e.g. TNM + 1 0% FCS or SF900 medium. Insect cells are usually grown at 27 °C 
as adhesion or suspension culture. 
25 Suitable expression protocols for eukaryotic cells are well known to the skilled person and can be retrieved e.g. from in 
Sambrook, 2001. 

Such methods are well known in the art (see, e.g., Sambrook et al., supra). 

[0040] An alternative method for producing the peptide of the invention is in vitro translation of mRNA. Suitable cell- 
free expression systems for use in accordance with the present invention include rabbit reticulocyte lysate, wheat germ 
30 extract, canine pancreatic microsomal membranes, E. coil S30 extract, and coupled transcription/translation systems 
such as the TNT-system (Promega). These systems allow the expression of recombinant peptides or proteins upon the 
addition of cloning vectors, DNA fragments, or RNA sequences containing coding regions and appropriate promoter 
elements. 

[0041] Methods of isolation of the peptide produced are well-known in the art and comprise, without limitation, method 
35 steps such as ion exchange chromatography, gel filtration chromatography (size exclusion chromatography), affinity 
chromatography, high pressure liquid chromatography (HPLC), reversed phase HPLC, disc gel electrophoresis or im- 
munoprecipitation (see, for example, Sambrook, 2001). 

[0042] The present invention also relates to a peptide comprising the amino acid sequence encoded by a nucleic acid 
molecule of the present invention or obtainable by the method of the present invention. 

^o [0043] The peptide of the present invention can also be produced synthetically. Chemical synthesis of peptides is well 
known in the art. Solid phase synthesis is commonly used and various commercial synthesizers are available, for example 
automated synthesizers by Applied Biosystems Inc., Foster City, CA; Beckman; MultiSyntech, Bochum, Germany etc. 
Solution phase synthetic methods may also be used, although it is less convenient. By using these standard techniques, 
naturally occurring amino acids may be substituted with unnatural amino acids, particularly D-stereoisomers, and also 

45 with amino acids with side chains having different lengths or functionalities. Functional groups for conjugating to small 
molecules, label moieties, peptides, or proteins or for purposes of forming cyclized peptides may be introduced into the 
molecule during chemical synthesis. In addition, small molecules and label moieties may be attached during the synthetic 
process. Preferably, introduction of the functional groups and conjugation to other molecules minimally affects the 
structure and function of the subject peptide. 

50 The N-and C-terminus may be derivatized using conventional chemical synthetic methods. The peptides of the invention 
may contain an acyl group, such as an acetyl group. 

Methods for acylating, and specifically for acetylating the free amino group at the N-terminus are well known in the art. 
For the C-terminus, the carboxyl group may be modified by esterification with alcohols or amidated to form-CONH 2 or 
CONHR. Methods of esterification and amidation are done using well known techniques. 
55 [0044] Furthermore, the peptide of the invention may also be produced semi-synthetically, for example by a combination 
of recombinant and synthetic production. In the case that fragments of the peptide are produced synthetically, the 
remaining part of the peptide would have to be produced otherwise, e.g. recombinantly, and then linked to the fragment 
to form the peptide of the invention. 
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[0045] The present invention furthermore relates to a fusion molecule comprising the peptide of the invention. 
[0046] A fusion molecule, in the context of the present invention is an at least bipartite molecule comprising the peptide 
of the invention forming one moiety coupled to at least one other moiety, as has been defined above. 
[0047] The fusion molecule of the present invention can be produced and isolated according to the methods described 
5 above for the production of the peptide of the invention. 

[0048] In a preferred embodiment, the peptide is fused to a nucleic acid, a peptide or a polypeptide, an aptamer, a 
small molecule, a nanoparticle or nanocarrier or a contrast agent. 

The nucleic acid can be linear or circular, e.g. in the form of a plasmid, an antisense RNA or siRNA. In this regard, the 
above described mucleic acid mimicking molecules are also suitable as moieties. Aptamers are DNA or RNA molecules 
10 that have been selected from random pools based on their ability to bind other molecules. Aptamers include those which 
bind nucleic acid, proteins, small organic compounds, and even entire organisms such as unicellular organisms. A 
database of aptamers is maintained at http://aptamer.icmb.utexas.edu/. 

More specifically, aptamers can be classified as DNA or RNA aptamers or peptide aptamers. Whereas the former consist 
of (usually short) strands of oligonucleotides, the latter consist of a short variable peptide domain, attached at both ends 
is to a protein scaffold. 

[0049] Nanoparticles (or nanopowders or nanoclusters or nanocrystals or nanocarriers) are microscopic particles with 
at least one dimension less than 100 nm. Nanocarriers, usually having a diameter of about 50 to 500 nm, are able to 
house smaller molecules, especially drugs which can then be delivered at the desired site. Nanocarriers existing to date 
are temperature and/or pH sensitive, which means that they can release their cargo upon heating or a change in the 
20 pH. They are mostly structurally stable in the normal physiological environment and resistant to e.g. intravenous admin- 
istration. Polymeric core-shell nanocarriers are small in size (generally less than 200 nm), with shells that protect enclosed 
bioactive compounds against degradation and digestive fluids. 

[0050] In a preferred embodiment of the fusion molecule of the present invention, the peptide of the invention is fused 
to a reporter unit. Suitable reporter units are well-known to the skilled person and comprise fluorescent dyes such as 
25 FITC orTAMRA or reporter units for MR I or PET such as Gd-DOTA-, Gd-DTPA-, 64 Cu-DOTA- or 68 Ga-DOTA, or nucleic 
acids such as si RNA, antisense oligonucleotides or nucleic acids encoding reporter genes, preferably operably linked 
to a regulatory element, such as a promoter, or contrast agents for other imaging techniques. 

[0051] In another preferred embodiment of the fusion molecule of the present invention, the peptide of the invention 
is fused to a pharmacologically active compound. Pharmacologically active compounds may belong to different substance 
30 classes such as e.g. nucleic acids, peptides, small molecules, nanoparticles etc. as listed above. 

[0052] In case the peptide of the invention is fused to more than one other moiety, suitable combinations of moieties 
can be e.g. a linker moiety and a pharmacologically active moiety, or a detectable moiety and a pharmacologically active 
moiety, optionally further comprising one or more linker moieties. 

[0053] In a preferred embodiment, the peptide or fusion molecule is applied for research purposes. 
35 The peptide or fusion molecule of the present invention can be applied e.g. in optical imaging methods, magnetic 
resonance imaging (MRI), primarily used in medical imaging to demonstrate pathological or other physiological alterations 
of living tissues, or positron emission tomography (PET), a nuclear medicine medical imaging technique which produces 
a three-dimensional image or map of functional processes in the body. 

In MRI, a contrast agent may be applied to improve the measurement. The agent may be as simple as water, taken 
orally, e.g. for imaging the stomach and small bowel although substances with specific magnetic properties may also 
be used. Most commonly, a paramagnetic contrast agent (usually a gadolinium compound) is given. This provides high 
sensitivity for e.g. the detection of vasculartissues (e.g. tumors) and permits assessment of brain perfusion (e.g. in stroke). 
More recently, superparamagnetic contrast agents (e.g. iron oxide nanoparticles) have become available. These may 
be used for liver imaging - normal liver tissue retains the agent, but abnormal areas (e.g. scars, tumors) do not. They 
45 can also be taken orally, to improve visualisation of the gastrointestinal tract, and to prevent water in the gastrointestinal 
tract from obscuring other organs (e.g. pancreas). 

Diamagnetic agents such as barium sulfate have been studied for potential use in the gastrointestinal tract, but are less 
frequently used. 

To conduct a PET scan, a short-lived radioactive tracer isotope, which decays by emitting a positron and which also has 
50 been chemically incorporated into a metabolically active molecule, is injected into the living subject (usually into the 
blood circulation). During a waiting period, the metabolically active molecule becomes concentrated in the tissues of 
interest; then the research subject or patient is placed in the imaging scanner. The molecule most commonly used for 
this purpose is fluorodeoxyglucose (FDG), a sugar, for which the waiting period is typically an hour. 
The peptide of the present invention may serve as shuttle in order to target the respective contrast agent coupled thereto 
55 to the site of interest in both methods. The intracellular delivery should provide a higher sensitivity as well as specificity 
as compared to known commercially available contrast agents. Aside of the more specific detection of targeted cells 
(e.g. in cancer diagnosis or the detection of loss of (3-cells in diabetes) such contrast agents can be used for non-invasive 
tracking of the fate and action of transplanted cells in cellular therapies (e.g. cancer treatment with cytotoxic T-cells). 
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Cell based therapies such as stem cell therapies or adoptive immunotherapies were already successfully tested and 
also monitored in animal models, mainly by invasive optical imaging methods (e.g. Sauer MG, 2004). However, the 
clinical application still suffers f rom the lackof a non-invasive diagnostic method for a long-term quantitative and qualitative 
evaluation of the transplanted cells. This is essential for the monitoring of the treatment and its efficacy. By a non-toxic 
5 labeling procedure (most preferable repeatedly applicable) with specifically targeted contrast agents cells can be followed 
in vivo, and their accumulation as well as function can be monitored using imaging techniques like MRI, nuclear or optical 
imaging. 

[0054] The present invention furthermore relates to a pharmaceutical composition comprising the peptide of the in- 
vention or the fusion molecule of invention. 
10 [0055] The term "composition", as used in accordance with the present invention, relates to a composition which 
comprises at least one of the recited compounds. The composition may be in solid or liquid form and may be, inter alia, 
in the form of (a) powder(s), (a) tablet(s), (a) solution(s) or (an) aerosol(s). 

[0056] In accordance with the present invention, the term "pharmaceutical composition" relates to a composition for 
administration to a patient, preferably a human patient. The pharmaceutical composition of the invention comprises the 

is compounds recited above, alone or in combination. It may, optionally, comprise further molecules capable of altering 
the characteristics of the compounds of the invention thereby, for example, stabilizing, modulating and/or activating their 
function. The pharmaceutical composition of the present invention may, optionally and additionally, comprise a phar- 
maceutical^ acceptable carrier. By "pharmaceutically acceptable carrier" is meant a non-toxic solid, semisolid or liquid 
filler, diluent, encapsulating material or formulation auxiliary of any type. Examples of suitable pharmaceutical carriers 

20 are well known in the art and include phosphate buffered saline solutions, water, emulsions, such as oil/water emulsions, 
various types of wetting agents, sterile solutions, organic solvents including DMSO etc. Compositions comprising such 
carriers can be formulated by well known methods. Generally, the formulations are prepared by contacting the compo- 
nents of the pharmaceutical composition uniformly and intimately with liquid carriers or finely divided solid carriers or 
both. Then, if necessary, the product is shaped into the desired formulation. 

25 [0057] In a preferred embodiment, the composition is suitable for parenteral administration. The term "parenteral" as 
used herein refers to modes of administration, which include intravenous, intramuscular, intraperitoneal, intrasternal, 
subcutaneous and intraarticular injection and infusion. These pharmaceutical compositions will be administered to the 
subject at a suitable dose. The dosage regimen will be determined by the attending physician and clinical factors. As is 
well known in the medical arts, dosages for any one patient depends upon many factors, including the patient's size, 

30 body surface area, age, the particular compound to be administered, sex, time and route of administration, general 
health, and other drugs being administered concurrently. The therapeutically effective amount for a given situation will 
readily be determined by routine experimentation and is within the skills and judgement of the ordinary clinician or 
physician. Generally, the regimen as a regular administration of the pharmaceutical composition should be in the range 
of 1 |xg to 5 g units per day. However, a more preferred dosage might be in the range of 0.01 mg to 1 00 mg, even more 

35 preferably 0.01 mg to 50 mg and most preferably 0.01 mg to 10 mg per day. 

[0058] The components of the pharmaceutical composition to be used for therapeutic administration must be sterile. 
Sterility is readily accomplished by filtration through sterile filtration membranes (e.g., 0.2 micron membranes). 
[0059] The components of the pharmaceutical composition ordinarily will be stored in unit or multi-dose containers, 
for example, sealed ampoules or vials, as an aqueous solution or as a lyophilized formulation for reconstitution. As an 

^o example of a lyophilized formulation, 1 0-ml vials are filled with 5 ml of sterile -filtered 1 % (w/v) aqueous solution, and the 
resulting mixture is lyophilized. The infusion solution is prepared by reconstituting the lyophilized compound(s) using 
bacteriostatic Water-f or- Injection. 

Preservatives and other additives may also be present such as, for example, antimicrobials, anti oxidants, chelating 
agents, and inert gases and the like. Furthermore, the pharmaceutical composition may comprise further agents de- 

45 pending on its intended use. 

[0060] The pharmaceutical composition may be particularly useful for the treatment and/or prevention of diseases, 
including for example cancer, enzyme deficiency diseases, infarcts or cerebral ischemia, diabetes, inflammatory dis- 
eases, infections such as bacterial, viral orf ungal infections, autoimmune diseases such as systemic lupus erythematodes 
(SLE), multiple sclerosis or rheumatoid arthritis, diseases with amyloid-like fibrils such as Alzheimer's disease (AD) and 

50 Parkinson's disease (PD) or certain forms of myopathy. Further preferred diseases are those where the pharmaceutical 
composition has to cross restrictive membrane structures such as the blood-brain barrier in order to be effective. Examples 
of such diseases are AD or PD. 

[0061] Cancer, in accordance with the present invention refers to a class of diseases or disorders characterized by 
uncontrolled division of cells and the ability of these to spread, either by direct growth into adjacent tissue through 
55 invasion, or by implantation into distant sites by metastasis (where cancer cells are transported through the bloodstream 
or lymphatic system). 

[0062] Enzyme deficiencies leading to disorders are caused by one or more mutations in one or more enzymes in a 
cell including fatty acid oxidation disorders, urea cycle disorders, phenylketonuria, glycogen branching enzyme deficiency 
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and mitochondrial enzyme deficiencies. 

[0063] Bacterial infections, in accordance with the present invention include but are not limited to bacterial meningitis, 
cholera, diphtheria, listeriosis, pertussis (whooping cough), pneumococcal pneumonia, salmonellosis, tetanus, typhus 
or urinary tract infections. 

5 [0064] Viral infections, in accordance with the present invention include but are not limited to mononucleosis, AIDS, 
chickenpox, common cold, cytomegalovirus infection, dengue fever, Ebola hemorrhagic fever, hand-foot and mouth 
disease, hepatitis, influenza, mumps, poliomyelitis, rabies, smallpox, viral encephalitis, viral gastroenteritis, viral en- 
cephalitis, viral meningitis, viral pneumonia or yellow fever. 

[0065] Fungal infections in accordance with the present invention include but are not limited to aspergillosis, blasto- 
ff mycosis, candidiasis, coccidioidomycosis, cryptococcosis, histoplasmosis or tinea pedis. 

[0066] Autoimmune diseases, in accordance with the present invention referto diseases which arise from an overactive 
immune response of the body against substances and tissues normally present in the body. Autoimmune diseases are 
well known to the person skilled in the art and include, but are not limited to Lupus erythematosus, acute disseminated 
encephalomyelitis, aplastic anemia, autoimmune hepatitis, diabetes mellitus, multiple sclerosis, optic neuritis or rheu- 
is matoid arthritis. 

[0067] Diseases with amyloid-like fibrils in accordance with the present invention are diseases which share as a 
common feature that the normally soluble peptide amyloid-beta or the protein alpha-synuclein aggregates into an ordered 
fibrillar structure typically resulting in increased oxidative injury, excitotoxicity and altered cell cycle. Diseases with 
amyloid-like fibrils include but are not limited to Alzheimer's disease (AD) and Parkinson's disease (PD). 
20 [0068] Alzheimer's disease is a neurodegenerative disease characterized by progressive cognitive deterioration to- 
getherwith declining activities of daily living and neuropsychiatric symptoms or behavioral changes. It is the most common 
type of dementia. 

[0069] Parkinson's disease is a degenerative disorder of the central nervous system that often impairs the sufferer's 
motor skills and speech. 

25 [0070] Myopathies are neuromuscular diseases in which the muscle fibers do not function, resulting in muscular 
weakness. Several classes of myopathy are known and include but are not limited to for example muscular dystrophies, 
congenital myopathies, mitochondrial myopathies or inflammatory myopathies. 

[0071] In a preferred embodiment of the present invention, the pharmaceutical composition is used as a vaccine to 
prevent, inter alia, bacterial, viral or fungal infections as described above. In a particularly preferred embodiment, the 

30 vaccine is a DNA vaccine. Accordingly, the peptide of the invention is coupled to a nucleic acid which, when introduced 
into a cell, will be translated and the resulting peptide or protein will be processed in order to be presented in the form 
of antigenic fragments on the surface of cells, wherein the latter are usually found in bacteria, fungi or viruses. 
Alternatively, the composition of the present invention comprising the peptide of the invention coupled to another moiety, 
such as a peptide or protein, also directly provides the cell with the expressed protein/peptide or the molecule against 

35 which immunity is to be induced when introduced into a cell without the need for it to be translated. 

[0072] In the treatment of the above disorders and diseases, at least one pharmacologically active moiety is to be 
coupled to the peptide of the invention. As described above for moieties in general, pharmacologically active moieties 
may belong to different substance classes. 

[0073] In addition to a pharmacologically active moiety, the fusion molecule of the invention may further comprise a 
^o targeting sequence or moiety specifically targeting to the desired tissue or structure as described further below. 

[0074] In addition, the present invention relates to a diagnostic composition comprising at least one of (a) the nucleic 
acid molecule of the invention, (b) the vector of the invention, (c) the peptide of the invention, or (d) the fusion molecule 
of the invention. 

[0075] In accordance with the present invention, the term "diagnostic composition" relates to compositions for diag- 
45 nosing individual patients for their potential response to or curability by the pharmaceutical compositions of the invention. 
Furthermore, a diagnostic composition can denote a substance comprising the peptide of the invention fused to at least 
one more moiety for research purposes as has already been described above for the peptide and the fusion molecule 
of the invention. The diagnostic composition may further comprise appropriate buffer(s). The diagnostic composition 
may be packaged in a container or a plurality of containers. 
50 [0076] Preferred embodiments of the diagnostic composition comprise the peptide of the present invention fused to 
a reporter unit and a target sequence specifically targeting to the desired tissue or structure. For example, the target 
sequence can be an antisense oligonucleotide or a peptide sequence which is enzymatically cleavable. Such target 
sequences can be designed in order to deliver the reporter units to e.g. cancerous cells or tissues or otherwise diseased 
cells or tissues. 

55 Using the diagnostic composition of the present invention, diseases such as cancer can be detected in an early stage 
thus enabling for an early and more promising therapy. In particular, the pH value in cancerous tissue is usually more 
acidic than in non-cancerous tissue. Accordingly, a reporter unit sensitive to changes in the pH value and thus capable 
of detecting tissue with an e.g. more acidic pH value, in connection with the diagnostic agent of the present invention 



13 



EP 2 090 584 A1 



may serve in the detection of cancer. 

Another example refers to the detection of insulin mRNA in insulin producing (3-cells by coupling the peptide of the 
present invention to an antisense nucleic acid binding to insulin mRNA. With this molecule as a diagnostic tool, the 
insulin production in (3-cells can be monitored and the decrease of insulin production and/or loss of p-cells which may 

5 indicate an early stage of diabetes can be detected. 

Yet a different example refers to the detection of transplanted, healthy cells in the body after a transplantation event to 
monitor the tolerance of the body towards the graft as well as the fate of transplanted cells in the body. 
[0077] The above-described combinations of moities usable in the detection and/or diagnosis with therapeutically 
applicable moieties become more and more important in the context of theranostics. This is the term used to describe 

10 the proposed process of diagnostic therapy for individual patients, to test them for possible reaction to taking a new 
medication and to tailor a treatment for them based on the test results. 

[0078] The present invention furthermore relates to a method of detecting the internalization behaviour of a fusion 
molecule comprising the peptide of the invention or a fusion molecule according to the invention, comprising (a) admin- 
istering said fusion molecule to a cell and (b) detecting the internalization of the fusion molecule. 

is [0079] The present method is particularly suitable to estimate the applicability of the peptide of the present invention 
in connection with other moieties coupled thereto for medical or research purposes, e.g. in the form of pharmaceutical 
or diagnostic compositions. If efficient internalization of the peptide or the fusion molecule comprising more than one 
moieties is detected and, optionally, its localization is in the cytoplasm, this indicates that the respective molecule can 
be efficiently used for medical and research purposes. 

20 [0080] In addition, the present invention relates to a method of treating, preventing or diagnosing a condition selected 
from cancer, enzyme deficiency diseases, infarcts, cerebral ischemia, diabetes, inflammatory diseases, infections such 
as bacterial, viral orfungal infections, autoimmune diseases such as systemic lupus erythematodes (SLE) or rheumatoid 
arthritis, diseases with amyloid-like fibrils such as Alzheimer's disease (AD) and Parkinson's disease (PD) or certain 
forms of myopathy comprising administering the composition of the invention to a subject in need thereof. 

25 [0081] The present invention also relates to the peptide, the fusion molecule or the composition of the invention for 
therapeutic or diagnostic purposes. In a preferred embodiment the therapeutic purpose is the treatment and/or prevention 
of cancer, enzyme deficiency diseases, infarcts, cerebral ischemia, diabetes, inflammatory diseases, infections such as 
bacterial, viral or fungal infections, autoimmune diseases such as systemic lupus erythematodes (SLE) or rheumatoid 
arthritis, diseases with amyloid-like fibrils such as Alzheimer's disease (AD) and Parkinson's disease (PD) or certain 

30 forms of myopathy. 

[0082] The present invention furthermore relates to a kit comprising at least one of (a) the nucleic acid molecule of 
the invention, (b) the vector of comprising the nucleic acid molecule of the invention, (c) the host cell comprising the 
vector of the invention, (d) the (poly)peptide of the invention or (e) the fusion protein of the invention. 
[0083] The various components of the kit may be packaged in one or more containers such as one or more vials. The 
35 vials may, in addition to the components, comprise preservatives or buffers for storage. 
[0084] The figures show: 

Figure 1 : Comparison of the intracellularfluorescence of the synthesized peptide fragments (cargo: K(FITC), peptide 
23 = SEQ ID NO: 2). corr. f.u. = corrected fluorescence units (measured cell-related FITC fluorescence corrected 
^o for the number of cells evaluated by Bisbenzimid 33342 fluorescence. 

Figure 2: Influence of substitution of cysteines by serine in peptide 23 (SEQ ID NO: 2) on the internalization efficiency, 
ns, not significant; *, p<0.05, ***, p<0.001 significantly different compared to peptide 23 (ANOVA, Bonferroni's 
Multiple Comparison Test). K(FITC) was coupled to all peptides at the N-terminus. 

45 

Figure 3: Influence of sequence order and amino acid chirality on the internalization efficiency. *, p<0.05, **, p<0.01 , 
***, p<0.001 significantly different compared to peptide 23 (ANOVA, Bonferroni's Multiple Comparison Test). K 
(FITC) or k (FITC) was coupled to all peptides at the N-terminus. 

50 [0085] The invention will now be described by reference to the following examples which are merely illustrative and 
are not to be construed as a limitation of scope of the present invention. 

Example 1: Peptide Synthesis: materials and method 

55 [0086] All solvents were of peptide synthesis grade. N,N-dimethylformamide (DMF), dichloromethane (DCM), trifluor- 
oacetic acid (TFA), fluorescein isothiocyanate (FITC) and methanol were purchased from Acros Organics, Belgium. 
Protected Fmoc amino acid as well as resin was obtained from Novabiochem (Nottingham, UK). The side chain of lysine 
was protected by Boc or Dde, Cys by Trt ortBu, Arg by Pbf and, Trp by Boc. 
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(A) Automatic Peptide Synthesis: 

[0087] Peptides were prepared by fully automated solid-phase peptide synthesis using Fmoc/tBu -strategy and a- 
Fmoc-(e-BOC)-lysine-TCP-polystyrene resin. The resin was distributed in 30 mg aliquots (15 |j,mol) to filter tubes, which 

5 were positioned in the format of a microtiter plate on valve blocks. Fmoc deprotection were carried out two times, 10 
min. each, with 30 % piperidine in dimethylformamide (DMF) (300 jxl). Nine washing steps were done with DMF (300 
jxl). Fmoc-amino acids (0.5 M) were dissolved with 1 -hydroxybenzotriazole (HOBt) (0.5 M) in DMF. Diisopropylethylamine 
[3 M in NMP, 60 |xl] was added to the reaction vessels. Coupling reagents diisopropylcarbodiimide [3 M in DMF , 50 juU, 
first coupling] or 1 -H-benzotriazolium-1-[bis(dimethylamino)methylene]-5-chloro-tetrafluoroborate (1-), 3-oxide [TCTU, 

10 0.5 M in DMF, 200 pj, second coupling] and Fmoc-amino acids [first coupling 200 fxl; second coupling 100 mJ] were 
distributed to the reaction vessels. After 1 h coupling time, coupling reagents were filtered off and the resins were washed 
with DMF [1 x 200 juil] followed by a second coupling step (1 h). Amino acids were introduced using a sevenfold molar 
excess of the respective Fmoc-L-amino acid. After washing steps [4 x 400 \x\] Fmoc deprotection was carried out two 
times, 10 min. each, with 30 % piperidine in DMF (300 |xl). Nine washing steps were done with DMF (300 |xl). The 

is coupling process was repeated according to the length of the peptide. The e-group of N-terminal lysine was labeled 
manually with FITC (4 fold excess) with triethylamine (1 :2) in DMF overnight. Peptides were washed with DMF, DCM, 
methanol (4x each) and dried before cleaving from the resin. The peptides were cleaved off the resin and side-chain 
deprotected with Reagent K (500 |jj). 

20 (B) Manual Peptide Synthesis: 

[0088] The synthesis of selected peptides was carried out by solid phase Fmoc chemistry using a manual multiple 
peptide synthesizer (Heidolph Synthesis 1 synthesizer). Peptides were synthesized on <x-Fmoc-(e-BOC)-lysine-2-chlo- 
rotrityl resin (0.83 mmol/g). Fmoc protected amino acids (4 fold excess) were coupled with DIC/ HOBT activation for 60 
25 min. Fmoc was removed by 30% piperidine in DMF (2x10 min). Resin was washed after each cycle of coupling and 
deprotection with DMF (4x). Completeness of coupling and deprotection was monitored by Kaiser Test (38). Peptides 
were washed with DMF, DCM, methanol (4x each) and dried. The peptides were cleaved off the resin and side-chain 
deprotected with Reagent K (500 juul) . 

This cleavage reaction was optimized further on, in particular for the synthesis of DOTA based compounds, by using 
30 TFA: Reagent K:TMSBr (8. 5: 1.45:0. 05) for 2 hr instead of Reagent K alone. Forthe oxidized peptides a different cleavage 
cocktail consisting of TFA:TIPS:H 2 0 was used for 1 h followed by addition of DMSO (5:1 ) at 0°C for 30 min and stirring 
for 1 h at RT. 

Example 2: Purification and characterization: 

35 

[0089] All peptides were precipitated with MTBE. Precipitates were collected by centrifugation and resuspended in 
cold MTBE 2 times and were lyophilized. Samples were purified by semipreparative RP-HPLC at RT using water/0.1% 
TFA (solvent A) and Acetonitrile/0.1% TFA (solvent B) on a Varian Polaris C18-Ether column (21.2 mm in diameter, 
length of 250 mm, particle size 5 |mm). The product containing fractions were defined by investigation with analytical RP- 
HPLC on a Varian Polaris C18-Ether column (4.6mm in diameter, length of 250 mm, particle size 5 |iim). Mass spec- 
trometry (ESI-MS) was used forfurther characterization. The purified product was dissolved in water and tert. butyl alcohol 
(1 :4) with 2% acetic acid, and lyophilised. 

Peptides containing cysteine are proneto oxidative formation of disulphide bonds, which could be formed intramolecularly, 
resulting in a cyclic peptide, or intermolecularly, forming oligomers or aggregates. 
45 As this CPP is a cysteine rich peptide, care has to be taken during synthesis and storage of the peptides. During the 
synthesis of peptide, choice of protecting groups and proper scavengers during cleavage should be optimized. After 
lyophilization peptides were stored under nitrogen. Resolved samples for internalization studies were aliquoted and 
stored at -80°C. Solutions for cell studies were freshly prepared for each experiment from these aliquots. 

50 Example 3: SAR studies 

[0090] Various fragments derived from the proposed fragment (Cro 27 _ 39 ) were synthesized to achieve an optimal 
fragment maintaining the distinctive features of the parent sequence. The original fragment Cro 27 _ 39 (peptide 1, see 
table 1) is a peptide of 13 amino acid including synthetically difficult amino acids like methionine or cysteine. The aim 
55 of this study was to simplify the sequence maintaining the distinctive features of the parent sequence Cro 27 .3 9 . To 
visualize this aim series of peptides were synthesized by substituting or deleting amino acid residues but preserving the 
basic amino acids in the sequence. K (FITC) or k (FITC) was coupled to all peptides at the N-terminus to measure the 
internalization efficiency. 
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(A) Substitution of cysteine: 

[0091] The purpose of substitution was to understand the importance of cysteine in the sequence. Replacement of 
cysteines one by one or all together by aminobutyric acid or serine will help in estimating the number and position of 
5 cysteines responsible for maintaining penetration ability and to minimize the complexity of the synthesis by avoiding 
cysteines. 

(B) Deletion of cysteines: 

w [0092] The same rationale was valid studying the possibilities of avoiding cysteines completely. In the way of doing 
so we tried to delete cysteines from the sequence one by one and studying the effect of the deletion on the internalization 
efficiency. 

(C) Shortening the sequence: 

15 

[0093] To find the minimal amino acid composition necessary for the best uptake, combinations were designed by 
deleting amino acids from the N-terminus keeping the positively charged amino acids the same. On the other hand, 
deletion of the positively charged amino acids from C-terminus was also studied. 

20 (D) Shortening the sequence and substitution: 

[0094] Other variations include combined substitution and deletion to study the combined effect of the two. This 
included substitution of the cysteines by amino butyric acid and deletion from N-terminal amino acids one by one (until 
reaching the first cysteine). The idea was to check the appropriate length and the number of cysteine residues needed. 

25 

(E) Substitution of tryptophan: 

[0095] Tryptophans are known to be prone to side reactions like alkylation. Also blends of tryptophans and cysteines 
are not the most favorable ones as there are more chances of side reactions during the course of disulphide formation 
30 by various methods chemically. Therefore tryptophan was substituted by proline or phenylalanine one by one or com- 
pletely. 

(F) Effect of chirafity: 

35 [0096] Screening of a number of fragments for cellular uptake led to a peptide shorter in length (peptide 23, SEQ ID 
NO: 2). Studies were carried on this peptide in order to check factors other than the amino acid composition itself. 
Other distinctive features are the influence of the chirality of the peptide backbone as well as the sequence order on the 
cellular uptake. Peptides with d-isomers (whole sequence with D-amino acids), L-amino acids in reversed order and 
with D-amino acids in reversed order were synthesized. In addition, the effect of stereochemistry was studied only in 

^o the region of Trp-Arg-Trp-Arg by using single D-amino acids or exchanging all four residues. This pattern seemed to be 
important as indicated by the results of the initial screening. 

(G) introduction of a spacer unit: 

45 [0097] Studies show the importance of cysteines in the proposed fragment and their oxidation status. To extend our 
study in this direction attempts were made focusing on oxidation of the proposed fragment. In SEQ ID NO: 2 two cysteines 
are adjacent to each other. Investigations were carried out to estimate the favorable conditions for disulphide bond 
formation between them. The purpose of introduction of the spacer glycine was not only to study the necessity of an 
optimal distance between two cysteines for disulphide linkage but also to facilitate the synthesis of defined disulphide 

50 bonds through fragment condensation. Additionally, the effect of elongating the sequence by introducing an additional 
cysteine was studied for the convenient synthesis of peptides with two defined disulphide linkages. 
[0098] Results of the experiments are depicted in Figures 1 to 3. 

Example 4: Effect of cargos on uptake 

55 

[0099] CPPs are known to be the carrier of various cargos through the plasma membrane. Cellular uptake of various 
cargos was studied. Cargos were linked to the peptide through a linker which can be varied according to the need. In 
this study lysine is used as a linker. Choice of the amino protecting groups depends on the requirement of the synthesis. 



16 



EP 2 090 584 A1 



Fluorescent dye as a cargo: 
[0100] 

5 >> Coupling of FITC to e-amino group: 

For structure activity studies all peptides were coupled to FITC through a Boc-Lysine (Fmoc)-OH as a linker. 
The e-group of N-terminal lysine was labeled with FITC (4 fold excess) with triethylamine (1 :2) in DM F overnight. 
Analysis and purification conditions are same as before (C). 

10 

^ Coupling of TAMRA and carboxy fluorescein: 

Different dyes were coupled at e-amino group to study the effect of cargo on internalization. Different colored 
dyes were synthesized for the chasing experiments to study the mechanism of internalization. TAMRA was 
is coupled both at a-amino and e-amino group to study the effect of the position of the cargo on internalization. 

Coupling of these dyes were done by DIC, HOBt activation for 8h. 
Analysis and purification conditions are same as before (C). 

MR reporter as cargo: 

20 

[0101] 

>- Gd(DOTA) as cargo: 

25 To extend the variations in the range of the cargos an attempt was made forthe synthesis of bimodal intracellular 

contrast agents using proposed peptide as a carrier of Gd (DOTA) [as MR reporter] and FITC [fluorescent 
marker]. 

Synthesis of the peptide was done as mentioned before. Coupling of the cargo was carried out by coupling of 
Fmoc-Lys (Dde)-OH as the linker. DOTA was coupled by DIC, HOBt activation for 12 h at a-amino position 
30 (after Fmoc deprotection). Later FITC was coupled at e-amino after Dde deprotection by 2% hydrazine in DMF 

3min X 2. Analysis and purification conditions are same as before (C). 

The actual concentration of the peptide was estimated by fluorescence spectroscopy to be - 25%. Accordingly 
GdCI 3 .6H 2 0 was added (1 :1 ) at pH 6 50°C for 1 8h. Repurification was done to get rid of the excess gadolinium. 
Analysis and purification conditions are same as before (C). Samples were dialyzed for 48h to remove excess 
35 Gadolinium and other inorganic salts. 

Example 5: Cell Culture 

[0102] NIH 3T3 mouse fibroblast cells were cultured as a monolayer at 37°C with 1 0% C0 2 in Dulbecco's Modified 
^o Eagle's Medium (DMEM) supplemented with 1 0% fetal bovine serum, 4 mM L-glutamine, 1 00 img/mL streptomycin and 
100 U/mL penicillin (all purchased from Biochrom AG, Germany). Cells were passaged by trypsinization with trypsin/ 
EDTA 0.05/0.02% (w/v) in phosphate-buffered saline (PBS; Biochrom AG, Germany) every second to third day. 

Example 6: Determination of concentrations of f luorescently labeled peptides 

45 

[0103] Peptides were dissolved in MilliQ water to obtain a 10mM solution by weight. For the determination of the exact 
peptide concentration, these stock solutions were diluted 1 :1 00 in DMEM. The absorbance of the solutions was measured 
by multiplate reader (BMG Labtech, Germany) at 485nm with ratiometric correction of turbidity at 690nm. The concen- 
trations of the stock solutions were calculated assuming e carboxyf | UOrescein 485 nm =81 ,000 l/(mohcm). The concentrations 
50 of TAM RA labeled peptides was calculated assuming e TAMRA 540 nm =40,000 I/ (mol-cm). 

Example 7: Uptake Assay 

[0104] Internalization experiments on cells were performed in 96 well microplates by inoculation of NIH 3T3 fibroblasts 
55 (1 x 1 0 4 cells/well). After 24 h, cells were incubated with 2.5 |xM of different peptide solutions in DMEM for additional 
18 h at 37 °C with 10% C0 2 . Before washing, cells were incubated with Bisbenzimid 33342 (Hoechst 33342), a nuclear 
stain, in order to estimate the cell number. Cells were washed with Hanks' buffered saline (Biochrom AG, Germany) 
and extracellular fluorescence was quenched by incubating with cold trypan blue 0.05% (w/v) in phosphate-buffered 
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saline (Biochrom AG, Germany) for 3 min followed by repeated washes with Hanks' buffered saline. Cell-related FITC 
fluorescence (Ex 485 nm/Em 530 nm) and cell number (Ex 346nm/ Em 460nm) was evaluated in a multiplate reader. 
Experiments were run at least three times for each peptide with six replicates. Statistical analysis was performed by 
Student's f-test or ANOVAwith Dunnett's post test. P values < 0.05 were considered significant. The results are depicted 
5 in Figures 1 to 3. 

Example 8: Microscopy 

[0105] The same cells were subjected to microscopic studies without fixation using a Zeiss Axiovert 200 M (Germany) 
10 microscope with a LD Plan NeoFluor 40X objective. The imaging conditions were kept constant for the observation of 
different samples. Cellular localization and distribution of the peptide was observed by irradiating with blue light (470/40 
nm) and observing at 525/50 nm. The bright punctate and encapsulated appearing FITC fluorescence was categorized 
as vesicular uptake while diffused fluorescence appeared to be distributed in the entire cell with similar intensity. Manual 
observation of at least three experiments was compiled to conclude if the peptide shows diffused, vesicular or both types 
is of uptake. Apart from FITC fluorescence, the nuclear labeling by Hoechst was observed at 460/50 nm and trypan blue 
fluorescence viewed at 645/75 nm. Also phase contrast images with DIC of the same area were made to observe if the 
cells maintain their normal morphology in the presence of peptides. 

Example 9: Time Course 

20 

[0106] Cells were seeded at a density of 1 x 10 4 cells/well in 96 well microplates. After 48 h, cells were incubated 
with 2.5 (xM of peptide solution in DM EM for different time points (30 min - 1 8h) at37°C. Labelingwith Hoechst, quenching 
and washings were performed as explained in detail in uptake assay. Fluorescence spectroscopy and microscopy was 
performed on the plate. 

25 

Example 10: Effect of temperature on uptake 

[0107] Cells were seeded at a density of 1 x 10 4 cells/well in 96 well microplates. After 48 h, cells were incubated 
with 2.5 fxM of different peptide solutions in DMEM for additional 4 h at 37°C or 4°C. Labeling with Hoechst, quenching 
30 and washings were performed as explained in detail in uptake assay. Fluorescence spectroscopy and microscopy was 
performed on the plate. 

Example 11: Elucidating the mechanism of uptake 

35 [0108] In order to understand the mechanism of internalization, cells were treated with a variety of inhibitors of uptake 
like wortmannin, methyl-(3-cyclodextrin, NaN 3 /2-deoxyglucose, chloroquine, etc. Cells were cultured at a density of 1 x 
10 4 cells/well in 96 well microplates. After 48 h, cells were pre-incubated with indicated inhibitors for 30 min followed by 
additional 4h co-incubation with 2.5 \M of different peptide solutions in serum free DMEM at 37 °C or4°C. Labeling with 
Hoechst, quenching and washings were performed as mentioned in detail in uptake assay. Fluorescence spectroscopy 

^o and microscopy was performed on the plate. 
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SEQUENCE LISTING 

<110> Max-Planck-Gesellschaf t zur Forderung der Wissenschaf ten 
e.V. 

EMC microcollections GmbH 
<120> Identification of a novel cysteine-rich cell penetrating peptide 
<130> N2735 EP 
<160> 2 

<170> Patentln version 3.3 

<210> 1 

<211> 30 

<212> DNA 

<213> artificial sequence 
<220> 

<2 21> source 

<223> /note=" Description of artificial sequence: DNA encoding the optimized CPP 
sequence" 

<400> 1 

tgtcgttggc gttggaaatg ttgtaaaaaa 30 

<210> 2 

<211> 10 

<212> PRT 

<213> artificial sequence 
<220> 

<221> source p~ 

<223> /note^'Description of artificial sequence: optimized CPP sequence" 
<220> 

<221> variation 

<222> 3 

<223> /replace= ,, c" 
<220> 

<221> variation 

<222> 6 

<223> /replace= ,l c n 
/replace^" a" 
/replace="g" 

<220> 

<221> variation 

<222> 12 

<223> /replace="c" 
/replace="a" 
/replace= n g" 

<220> 

<221> variation 

<222> 18 

<223> /replace="g" 
<220> 

<221> variation 

<222> 21 
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<223> /replace="c" 
<220> 

<221> variation 

<222> 24 

<223> /replace="c" 
<220> 

<221> variation 

<222> 27 

<223> /replace="g" 
<220> 

<221> variation 

<222> 30 

<223> /replace="g M 

<400> 2 

Cys Arg Trp Arg Trp Lys Cys Cys Lys Lys 
15 10 



Claims 

1. A nucleic acid molecule encoding a peptide capable of being internalized into a cell, wherein said nucleic acid 
molecule comprises 

(a) a nucleic acid molecule encoding a peptide having the amino acid sequence of SEQ ID NO: 2; 

(b) a nucleic acid molecule having the DNA sequence of SEQ ID NO: 1, wherein T is U if the nucleic acid 
molecule is RNA; 

(c) a nucleic acid molecule hybridizing under stringent conditions to the complementary strand of a nucleic acid 
molecule of (a) or (b), wherein the peptide encoded by said nucleic acid molecule has a cysteine at least at two 
positions selected from the group consisting of positions 1, 7 and 8 of SEQ ID NO: 2 and a positively charged 
amino acid at least at four positions selectedfrom the groups consisting of position 2, 4, 6, 9 or 1 0 of SEQ ID NO: 2; 

(d) a nucleic acid molecule encoding a peptide having at least 70% sequence identity with that of SEQ ID NO: 
2, wherein at least at two positions selected from the group consisting of positions 1 , 7 and 8 of SEQ ID NO: 2 
a cysteine is present and wherein at least at four positions selected from the groups consisting of position 2, 4, 
6, 9 or 10 of SEQ ID NO: 2 a positively charged amino acid is present; or 

(e) a nucleic acid molecule degenerate with respect to the nucleic acid molecule of (c) or (d). 

2. The nucleic acid molecule of claim 1 wherein the peptide further comprises a linker. 

3. The nucleic acid molecule of.claim 2, wherein the linker is a lysine. 

4. A vector comprising the nucleic acid molecule of any one of claims 1 to 3. 

5. A non-human host transfected or transformed with the vector of claim 4. 

6. The non-human host of claim 5 which is a cell. 

7. A method of producing a peptide comprising culturing the host of claim 6 under suitable conditions and isolating the 
peptide produced. 

8. A peptide encoded by the nucleic acid molecule of any one of claims 1 to 3 or produced by the method of claim 7. 

9. A fusion molecule comprising the peptide of claim 8. 
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10. The fusion molecule of claim 9, wherein the peptide is fused to nucleic acid, a protein or peptide, an aptamer, a 
small molecule, a nanoparticle or nanocarrier or a contrast agent. 

11. The fusion molecule of claim 9 or 10, wherein the peptide is fused to FITC, TAMRA, Gd-DOTA-, Gd-DTPA-, 64 Cu- 
5 DOTA-, 68 Ga-DOTA, siRNA, antisense oligonucleotides or nucleic acids encoding reporter genes. 

12. A pharmaceutical composition comprising the peptide of claim 8 or the fusion molecule of any one of claims 9 to 
1 1, optionally further comprising a pharmaceutically acceptable carrier, excipient and/or diluent. 

10 1 3. The composition of claim 1 2 which is a vaccine. 

14. A diagnostic composition comprising at least one of 

(a) the nucleic acid molecule of any one of claims 1 to 3, 
is (b) the vector of claim 4, 

(c) the peptide of claim 8, or 

(d) the fusion molecule of any one of claims 9 to 1 1 . 

1 5. A method of detecting the internalization behaviour of a fusion molecule comprising the peptide of claim 8 or a fusion 
20 molecule according to any one of claims 9 to 11 , comprising 

(a) administering said fusion molecule to a cell 

(b) detecting the internalization of the fusion molecule. 

25 16. A method of treating and/or preventing a condition selected from cancer, enzyme deficiency diseases, infarcts, 
cerebral ischemia, diabetes, inflammatory diseases, infections such as bacterial, viral or fungal infections, autoim- 
mune diseases such as systemic lupus erythematodes (SLE) or rheumatoid arthritis, diseases with amyloid-like 
fibrils such as Alzheimer's disease (AD) and Parkinson's disease (PD) or certain forms of myopathy comprising 
administering the composition of claim 12 or 13 to a subject in need thereof. 

30 

1 7. The peptide of claim 8, the fusion molecule of any one of claims 9 to 1 1 or the composition of claim 1 2 or 1 3 for the 
prevention and/or treatment of cancer, enzyme deficiency diseases, infarcts, cerebral ischemia, diabetes, inflam- 
matory diseases, infections such as bacterial, viral or fungal infections, autoimmune diseases such as systemic 
lupus erythematodes (SLE) or rheumatoid arthritis, diseases with amyloid-like fibrils such as Alzheimer's disease 

35 (AD) and Parkinson's disease (PD) or certain forms of myopathy. 

18. A kit comprising at least one of 

(a) the nucleic acid molecule of any one of claims 1 to 3 

(b) the vector of claim 4 

(c) the host of claim 6 

(d) the (poly)peptide of claim 8 or 

(e) the fusion protein of any one of claims 9 to 11 . 
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